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Abstract. Long-range correlations are found in symbolic sequences from human 
language, music and DNA. Determining the span of correlations in dolphin whistle 
sequences is crucial for shedding light on their communicative complexity. Dolphin 
whistles share various statistical properties with human words, i.e. Zipf 's law for word 
frequencies (namely that the probability of the ith most frequent word of a text is about 
and a parallel of the tendency of more frequent words to have more meanings. The 
finding of Zipf 's law for word frequencies in dolphin whistles has been the topic of an 
intense debate on its implications. One of the major arguments against the relevance of 
Zipf 's law in dolphin whistles is that is not possible to distinguish the outcome of a die 
rolling experiment from that of a linguistic or communicative source producing Zipf 's 
law for word frequencies. Here we show that statistically significant whistle- whistle 
correlations extend back to the 2nd previous whistle in the sequence using a global 
randomization test and to the 4th previous whistle using a local randomization test. 
None of these correlations are expected by a die rolling experiment and other simple 
explanation of Zipf's law for word frequencies such as Simon's model that produce 
sequences of unpredictable elements. 
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1. Introduction 

Long-range correlations liave been reported in different Icinds of symbolic sequences: 
human language [B [21 El H], DNA [5] and music [HI H]. A few studies have studied 
the span of correlations in sequences of behavior produced by other species [71 El Ej- 
Rather long-correlations (extending back at least to the 7th previous element) have 
reported in sequences of dolphin surface behavioral patterns [7] . A preliminary analysis 
of constraints in sequences of dolphin whistles was performed in Ref. [8] but strong 
conclusions were not reached due to the small size of the dataset. Determining the 
actual span of correlations in dolphin sequences is crucial for shedding light on the 
communicative complexity of dolphins whistles. 

Various similarities between human words and dolphin whistles have been reported. 
A parallel of Zipf 's law of meaning distribution, the tendency or more frequent words 
to have more meanings [10], has been found in dolphins whistles [11]. Zipf's law for 
word frequencies, namely, the probability of the ith most frequent word of a text, 
is ~ where a is the exponent of the law [10], has also been found in dolphin 
whistles [8]. However, the finding remains controversial because it has been argued that 
simply rolling a die could explain the presence of the law in dolphin whistles [I2]. The 
experiment consists of generating a sequence of faces by rolling a die. One of the faces 
of the die plays the role of a word delimiter. For instance, a die of six faces could 
produce 1, 5, 2, 6, 3, 2, 4, 3, 5, 6... Treating 6 as a pseudo-word delimiter, the previous 
sequence of faces becomes the sequence of pseudo-words 152, 32435, .... The experiment 
is an abstraction of the popular monkey typing experiment, that consists of typing at 
random on a keyboard (the face that plays the role of the word delimiter is the space 
and the other faces are letters) [I3]. Die rolling and monkey typing have been argued 
to explain or mimic Zipf's law for word frequencies in human words [13] and dolphin 
whistles [12]. Another die rolling experiment, where the probability of the ith face is 
the probability of the ith most frequent word, has been proposed [H]. 

The hypothesis of die rolling as an explanation for Zipf's law for word frequencies 
in human language in other species can be tested in at least two different ways: 

(i) By comparing the actual distribution of 'word' frequencies with the one that is 
actually produced by the die rolling process fT5^ . Concerning the die rolling 
experiment of Ref. [12], the parameters of the model that provide a satisfactory 
fit to actual word frequencies according to a statistically rigorous test, are indeed 
unknown [15] , in spite of the many previous claims about the good fit of the model 
using qualitative arguments [131 [12] • Concerning the die rolling experiment or Ref. 
[H] , the model is able to trivially provide a perfect fit to any theoretical or empirical 
discrete distribution, being Zipf's law for word frequencies a particular case. 

(ii) By comparing the statistical properties of the sequence of words produced with those 
of the sequence that is produced by the die rolling process. The rolling experiments of 
Refs. [121 [H] produce a sequence of independent 'words' in the sense that elements 
that have already been produced carry no information about the next element. In 
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contrast human language, shows long range correlations in texts using words (e.g. 
[21 [3]) or letters (e.g., [H [16]) as units of the sequence. 

Here we will follow the second track for dolphin whistles. The aim of the present article 
is to determine the span of correlations in dolphin whistle sequences and evaluate the 
suitability of die rolling [121 Ej- The challenge of the analysis is facing the statistical 
problems arising from the rather small size of the dataset of Ref. [8]. The danger of 
undersampling in the context of dolphins whistles has already been discusssed [8]. The 
next section presents the information theoretic measure approach that will be used to 
study correlations in dolphin whistle sequences in that dataset. 



2. Mutual information 



We consider pairs of whistles in a sequence and three related random variables: X for 
the whistle that is the 1st member of the pair, Y for the whistle that is the second 
member of the pair and D for their distance. We adopt the convention that consecutive 
elements are at distance 1, elements separated by one element are at distance 2, and 
so on...[T7]. Given a collection of sequences of whistles, p{X = x,Y = y\D = d) is 
defined as the probability that whistle x is followed by whistle y knowing that they are 
at distance d from each other and d^ax is defined as the maximum distance considered in 
the analysis (thus 1 < d < dmax)- Given a certain distance d, the marginal conditional 
probabilities are defined as 

p{X = x\D = d) = Y,P{X = x,Y = y\D = d) 

y 

p(Y = y\D = d) = Y,P{X = x,Y = y\D = d). 

X 

I{X; Y\D = d), the conditional mutual information between X and Y given a concrete 
distance d, is defined as [ir 



I{X;Y\D = d) = Y.p{X = x,Y = y\D = d)logq{X = x,Y = y\D = d), (l) 

x,y 

where 

V in ^^ p{X = x,Y = y\D = d) 

qiX = X,Y = V\U = d) = — ; : — ; ; -. 

' yi ^ p{X = x\D = d)p{Y = y\D = d) 

I{X] Y\D = d) has been used to study long-range correlations in DNA, texts and music 

[3111. 

f{X = x^Y = y\D = d) is defined as the number of times that x has been followed 
by y at distance d. The marginal conditional frequencies are defined as 

f^X = x\D = d)= Y,f{X = x,Y = y\D = d) (2) 

y 

f{Y = y\D = d) = Y,f{X = x,Y = y\D = d), (3) 

X 

and the total number of pairs at distance d is defined as 

F{d) = J2f{X = x\D = d)=Y.f{Y = y\D = d). 
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In a finite collection of sequences, p{X = x,Y = y\D = d), p{X = x\D = d) and 
p{Y = y\D = d) are relative frequencies, i.e. 

(If V \n ^^ f{X = x,Y = y\D = d) 
p{X = x,Y = y\D = d) = (4) 

n f(X = x\D = d) 
p(X = x\D = d) = ^ j;^^ (5) 

fv \n M f{Y = y\D = d) 
piY = y\D = d) = . (6) 

In a real collection of sequences, p{X = x,Y = y\D = d), p{X = x\D = d) and 
p{Y = y\D = d) are estimated from these relative frequencies. Applying Eqs. HI Eland 
|6] to Eq. [H yields the sample mutual information between X and Y given a concrete 
distance d, 



h{X-Y\D = d) = \ogF{d) + p'^^^ ^^ (7) 



where 



ai = ^ /(X = X, F = y\D = d) log/(X = x,Y = y\D = d) 

x,y 

a2=Y. = = ^) log /(^ = = ^) 

X 

^3 = E /(^ = y\D = d) log /(^ = y\D = d). 

y 

Next two useful properties of /^(X; Y\D = d) are presented: 

(i) If marginal conditional frequencies at distance d are boolean, i.e. f{X = x\D = 
d) = f{Y = y\D = d) e {0, 1} for any x and y then Y\D = d) = logF(d). 
To see it, notice that f{X = x\D = d) = f{Y = y\D = d) = 1 implies, thanks to 
Eqs. [2] and [3], that the joint frequencies are all boolean, i.e. f{X = x,Y = y\D = 
d) G {0, 1} for any x and y. According to Eq. [TJ the fact that both marginal and 
joint conditional frequencies do not exceed one yields ai = = o"3 = and thus 
Is{X] Y\D = d) = logF{d) as we wanted to prove. 

(ii) If all whistles at cooccurring at distance d are identical (only one whistle type has 
non-zero frequency), then (Xi = (72 = (T3 = F{d) log F{d) and then, according to Eq. 
m Is{X;Y\D = d) = 0. 



3. Methods 



We reused the collection of whistle sequences employed to study Zipf 's law in dolphin 
whistles [8]. A summary of the elementary statistical properties of the sequences is 
provided in Table [H Sequences of length smaller than 2 where filtered out. 



3.1. Distances where correlations are significant 

For each dolphin in the dataset, his/her collection of sequences was analyzed to extract 
a list of distances in the interval [2,d'^^^] at which Is{X;Y\D = d) is statistically 
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significant at a significance level of a = 0.05. For each dolphin and each distance, we 
used a Monte Carlo procedure for estimating a p-value indicating the probability that 
the value of /s(X; Y\D = d) from a randomized version of the data /^(X; Y\D = d) is 
at least as large as that of the original collection of sequences: 

(i) -R = 10^ randomized versions of the original data were generated. 

(ii) R>, the number of times the value of /^(X; Y\D = d) is at least as large as that of 
the original data was calculated for each d G [1, df^^^^] and the p- value was estimated 
as R>/R. 

(iii) For each dolphin, all distances d such that value = R>/R < a were added to the 
list of distances. 

3.2. Upper bounds for the span of correlations 

We say that Is{X]Y\D = d) is constant if Is{X;Y\D = d) is the same for any 
randomization of data upon which Is{X]Y\D = d) is computed. We consider two 
situations in which /^(X; Y\D = d) cannot be significantly high: (a) /s(X; Y\D = d) is 
constant in the sense above or (b) /s(X; Y; D = d) = as is actually the minimum of 
mutual information [18]. Accordingly, we consider three different ways of defining dmax'- 

• (imax) defined through the number of pairs of elements at distance ci in a sequence 
of length /, i.e. 7i{d, I) = I — d. The maximum distance d that can be considered 
in a collection of sequences where the longest sequence has length Imax is obtained 

from 71 {dmax, Imax) > 1, which gives = Imax - 1- 

• ^maxj defined as the smallest distance at which /s(X; Y\D = d) is constant in the 
sense above for any d G (c^max' '^maz]- 

• (imax; defined as the smallest distance beyond which Is{X]Y\D = d) cannot 
be significantly high for d G {dl^ax^ dP^ax]- Notice that rf^^^^ < because 
/s(X; Y\D = d) cannot be significantly high if Is{X; Y\D = d) is constant. 

The interest of these definitions of dmax is two- fold. First, bounding a priori the span of 
correlations between whistles. Second, reducing the computational cost of evaluating the 
significance of Is{X; Y\D = d) a.t distances where the result of the test is straightforward. 
Distances greater than (i^^^ can be discarded. 

We did not check if Is{X] Y\D = d) = using Eq. Has this is problematic due to 
finite numerical precision for real numbers. Eq. [1] indicates that Is{X; Y\D = d) = ii 
and only if q{X = x;Y = y\D = d) = 1 for any x and y such that p{X = x;Y = 
y\D = d) > 0. Applying Eqs. HI [5] and [6], it is easy to see that the condition 
q{X = x;Y = y\D = d) = 1 is equivalent to a more numerically convenient condition, 
i.e. F{d)f{X = x,Y = y\D = d) = f{X = x\D = d)f{Y = y\D = d). 

3. 3. Kinds of randomization 

Two kind of randomizations of the data upon /^(X;!^!/} = d) is calculated were 
considered: global and local randomization. A precise calculation of d]^^^ and d^^^ 
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Table 1. Summary of the elementary statistical properties of the collections of 
sequences of each dolphin. For each dolphin, the following information is shown: 
dolphin's name, T, the total number of whistle types, V, the number of different 
whistle types, 5* (the number of sequences), (I) — T/S (the mean sequence length in 
whistle types), Imax (the maximum length). Dolphins are sorted decreasingly by T. 





T 


V 

V 


Q 


(I) 


7 

''max 


Liberty 


334 


31 


102 


3.27 


11 


Norman 


235 


39 


74 


3.18 


21 


Panama 


148 


42 


43 


3.44 


10 


Chelsea 


110 


12 


34 


3.24 


8 


Sadie 


110 


13 


33 


3.33 


8 


Stormy 


94 


12 


27 


3.48 


9 


Delphi 


65 


18 


24 


2.71 


7 


Neptune 


53 


9 


9 


5.89 


13 


Circe 


47 


4 


15 


3.13 


7 


Desmond 


40 


3 


7 


5.71 


10 


Sam 


39 


4 


12 


3.25 


13 


Tasha 


35 


9 


12 


2.92 


7 


Bayou 


27 


10 


8 


3.38 


8 


Terry 


21 


7 


5 


4.20 


7 


Schooner 


19 


4 


6 


3.17 


6 


ECB 


15 


5 


6 


2.50 


4 


Gordo 


6 


6 


2 


3.00 


4 



depends on the kind of randomization. 

3.3.1. Global randomization Here the data that is randomized is the whole collection 
of sequences. The randomization procedure consisted of copying all whistles in a vector 
of length 

s 

T = Y.h. 

1=1 

where U is the length of the ii\i sequence and S is the number of sequences of the 
collection. Then, every randomized version of the collection is obtained by generating 
a uniformly distributed random permutation of the vector [19] and cutting that vector 
in pieces of lengths /i, ...Is (cut always in this order) to produce the randomized 
collection of sequences. Notice that the randomization procedure preserves the sequence 
lengths and the frequencies of each whistle in the original collection of sequences. 
Concerning the computation of d]^^^, notice that constant /^(X; Y\D = d) with regard 
to any global randomization can occur in three circumstances: 

(i) If all the whistle types are hapax legomena, i.e. they occur only once in the 
whole collection as all the marginal conditional frequencies are boolean and thus 
I{X; Y\D = d) = \ogF{d), as it has been shown above, for any d G [1, c^max]- 
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(ii) If the repertoire size is one (there is only one whistle type of non-zero frequency) 
then all the whistles cooccurring at a certain distance are identical and thus 
I{X;Y\D = d) =0, as it has been shown above, for any G [1, d^^^]. 

(iii) When F{d) = 1, all the marginal conditional frequencies are boolean in this case 
and then I(X; Y\D = d) = log F{d) = log 1 = 0. 

These conditions lead to the following specific procedure for calculating d}^^^. Satisfying 
condition (i) (this is the case of the dolphin 'Gordo') or (ii) implies d^^^ = 0. If none of 
these two conditions is met, (imax is calculated by means of condition (iii). In order to 
warrant F{d) > 1, d}^^^ must be ci^„^ if there is more than one sequence of maximum 
length and d}^^^ = d^^^ — 1 otherwise. To see it, notice that if the collection has Smax 
sequences of maximum length, then F{d]^^^) = SmaxT^{lmax ~ l^O — ^max and thus 
F{d) > 2 if and only if Smax > 2. 

3.3.2. Local randomization Here the data that is randomized are the pairs of whistles 
occurring at a certain distance d. The randomization procedure consisted of copying all 
the pairs of whistles occurring at distance d in a vector of length 2F{d). Then, every 
randomized version of the collection was obtained by generating a uniformly distributed 
random permutation of the vector [19] and then taking the ith and the (i + l)th whistles 
of the vector, with 1 < i < 2F{d), to form the jth pair of whistles of the randomized 
pairs of whistles, with 1 < j < F{d) and j = \i/2]. Notice that the randomization 
procedure preserves the frequencies of each whistle in the original pairs of whistles at 
distance d. 

Concerning the computation of rf^^a: local normalization notice that constant 
Is{X]Y\D = d) with regard to any local randomization for distance d, adds two new 
relevant conditions with regard to global randomization: 

(iv) When all whistle types are locally hapax legomena, i.e. all the whistle types 
forming the pairs at distance d occur only once in the ensemble of pairs at 
distance d. In that case, all marginal conditional frequencies are boolean and thus 
Is{X] Y] D = d) = F{d) as it has been shown above. 

(v) When the local repertoire has size one, i.e. all the pairs at distance d are made of 
a single whistle type. In that case, all Is{X; Y; D = d) = a.s it has been shown 
above. 

The procedure for calculating d^^^ under local normalization is the following. First, 
compute (imax with the same procedure used for global normalization. Decrease d^^^ 
while dm^^ > and at least one of the two following conditions is met at distance d^^^: 
(a) all whistle types are locally hapax legomena or (b) the local repertoire size is one. 

4. Results 

Table [2] shows the values of d where IsiX; Y\D = d) is significantly high in the dolphins 
from Ref. [8] at a significance level of 0.05 for global and local randomization, is 
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Table 2. Analysis of the span of correlation between whistles at a certain distance d 
using global and local randomization. For the collection of whistles sequences of each 
dolphin, the following information is shown: the dolphin's name, d^^'^ (the maximum 
distance according to different definitions) and the list of distances d S [l,'ima2;] such 
that Is{X;Y\D = d) is significantly high at a significance level of 0.05. Dolphins are 
sorted decreasingly by T. 



Global randomization Local randomization 



Name 


77iax 


d^ 

max 


d^ 

max 


Distances 


d^ 

max 


d^ 

max 


Distances 


Liberty 


10 


9 


7 




8 


7 


1, 4, 6, 7 


Norman 


20 


19 


15 


1, 2 


19 


15 


1,2, 3,4 


Panama 


9 


8 


7 


5 


8 


7 


1, 2, 3 


Chelsea 


7 


6 


3 


1 


4 


3 


1,2 


Sadie 


7 


6 


6 


1, 2 


6 


6 


1,2 


Stormy 


8 


7 


6 


1 


7 


6 


1,2,3 


Delphi 


6 


5 


5 




4 


4 


1 


Neptune 


12 


11 


7 


2 


11 


7 


1,2, 3,4 


Circe 


6 


5 


1 




1 


1 




Desmond 


9 


8 


3 


1 


8 


3 




Sam 


12 


11 


9 


7 


10 


9 




Tasha 


6 


5 


5 




5 


5 




Bayou 


7 


6 


6 




5 


5 


1,2 


Terry 


6 


5 


3 


1 


4 


3 


1 


Schooner 


5 


4 


2 


1 


4 


2 


1 


ECB 


3 


2 


2 




2 


2 




Gordo 


3 



















defined as the number of dolpliins for which Is{X]Y\D = d) is significantly high at a 
significance level a. Table [2] yields n^^^ = 7, n^'^^ = 3, rig'*^^ = n^'^^ = 1 for global 
randomization and n?'"^ = 11, n^'^^ = 7, 77,3°^ = 4, n"'"^ = 3, ng°^ = ny"^ = 1 for local 
randomization. 

Next we will perform a meta-analysis to determine when n^-^^ is significantly high 
because n^'^^ could be the outcome of false positives (type I statistical errors) from 
the test that determines if Is{X;Y\D = d) is significantly high for a certain dolphin. 
The p-value of a continuous statistic is known to be uniformly distributed under the 
null hypothesis [20]. In our case, Is{X]Y\D = d) is approximately continuous and the 
quality of the approximation is expected to increase with the size of the permutation 
space of the collection of sequences. This implies that nj^, the number of dolphins for 
which Is{X; Y\D = d) is significantly high at a significance level a follows approximately 
a binomial distribution with parameters A^^^ and a, where Nd is the total number of 
dolphins for whom Is{X;Y\D = d) can be significantly high according to the criteria 
above, and a = 0.05 is the significance level. Notice that the dolphin 'Gordo' is a 
special case because he does no have distances at which mutual information can be 
significantly high (he has d}^^^ = d^^^ = because all the whistles in his collection 
are hapax legomena; notice T = in Table [1]). Thus, A^^ is not simply the number of 
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Table 3. Summary of the test of the significance of ti^ '^^j ^^e number of dolphins for 
which Is{X;Y\D = d) is significantly high at a significance level of 0.05. Distances 
where n^ *'^ cannot be significantly high (as = is minimum) are excluded from 
the analyses and filled with — . Nd is the number of dolphins for whom Is{X;Y\D = d) 
can be significantly high. The p- values indicate the probability of reaching a value of 
71°"^ at least as high as the actual one simply by chance. The p-values were estimated 
by means of two different procedures: a binomial approximation and a Monte Carlo 
method, p- values were rounded to leave only one significant digit. 



d 




Global randomization 




Local 


randomization 


„0.05 
"■d 


Nd 


P- 


value 


„0.05 
"-d 


Nd 


P- 


value 


Binomial 


Numerical 


Binomial 


Numerical 


1 


7 


16 


6 X IQ-^ 


< 0.001 


11 


16 


2 X 10-" 


< 0.001 


2 


3 


15 


0.04 


0.02 


7 


15 


3 X 10-6 


< 0.001 


3 











4 


13 


0.003 


0.001 


4 











3 


10 


0.01 


0.009 


5 


1 


10 


0.4 


0.3 











6 











1 


7 


0.3 


0.2 


7 


1 


5 


0.2 


0.2 


1 


5 


0.2 


0.2 



dolphins in the dataset. 

A binomial test can be used to asses if is significantly large. The p-value of this 
test is 

p — value = 

d 

Concerning global randomization, a binomial test indicates that the number of dolphins 
showing a significant correlation is significantly high for d = 1 and d = 2 but not for 
d > 2 (Table |3]). Concerning local randomization, a binomial test indicates that the 
number of dolphins showing a significant correlation is significantly high for 1 < d < A 
but not for ci > 4 (Table [3]). 

The meta-analysis based upon a binomial test assumes that 0.05 is the probability 
of rejecting the null hypothesis by chance but indeed the accuracy of the assumption 
depends on the properties of the collection of whistle types. For instance, we have seen 
that the mutual information cannot be significantly high for the dolphin 'Gordo' and 
thus the probability of rejecting the null hypothesis by chance is thus for him. A Monte 
Carlo meta-analysis allows one to improve the approximate calculation of the p-value 
offered by Eq. [8] to some degree of numerical precision. Consider that a randomized 
ensemble of collections of sequences consists of a randomized version of the collection 
of each of the dolphins (excluding 'Gordo'). For each distance d, the p- value of is 
estimated by the proportion of times in randomized ensembles of collections that 
the value of is equal or greater than the value of of the original (non-randomized) 
ensemble. Rm = 1000 and R = 1000 were used. Concerning global randomization, 



X 



a^(l-a) 



Nd-x 



(8) 
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Table 4. Summary of the support for significance of each distance (based upon Table 
[3]). n^"^ is defined as the number of dolphins for which Is{X-, Y\D = d) is significantly 
high at a significance level of 0.05. For each kind of randomization (global or local) 
and meta-analysis (binomial approximation or Monte Carlo method) , the distances at 
which nj]'°^ is significantly high are indicated. 







Distances 






Global randomization Local randomization 


n0 05 > 

Meta-analysis 

Meta-analysis 


(binomial approx.) 
(Monte Carlo method) 


1,2,5,7 1,2,3,4,6,7 
1,2 1,2,3,4 
1,2 1,2,3,4 



one obtains that n^^^^ is significantly high for d = 1 and d = 2 but not for d > 2 
(Table El). Concerning local randomization, one obtains that ?7,° °^ is significantly high 
for I < d < 4 but not for d > 4 (Table [3]). To sum up, the conclusions of the two 
meta-analyses coincide from a qualitative point of view (Table S]). The binomial test 
provides a good enough approximation for both global and local randomization. 

From a global randomization perspective, correlations are short range: the presence 
of significant correlations at low distances [d = 2 and specially = 1) is unquestionable 
but significant correlations at higher distances could be false positives (Table H]). From a 
local randomization perspective, correlations extending back to the 4th back whistle type 
are unquestionable but farther significant correlations could be false positives (Table H]). 

5. Discussion 

We have demonstrated that, for the majority of individuals, a dolphin whistle carries (on 
average) a significant amount of information about the next whistles of the sequence 
(Tables [2] and [3]). Global randomization indicates that a whistle carries information 
about at least one of the next two whistles of the sequence whereas local randomization 
indicates that a whistle carries information about at least one of the next four whistles of 
the sequence (Table H]). This is a property that is inconsistent with die- rolling, where a 
pseudo-word carries no information at all about the next pseudo-words of the sequence. 
The fact that this is also a feature also shared by Simon's model for word frequencies 
|21] . questions the validity of a popular argument against the utility of Zipf's law for 
frequencies, namely that the law is of practically no help in assessing the complexity of 
a communication system because there are many ways of reproducing it [221 ISSl [IH [12] . 
Indeed, when the statistical properties of the sequence are taken into account, the 
number of candidate explanations drops down. The multiplicity of explanations for 
Zipf's law in a species such as dolphins depends on how many statistical features, 
besides Zipf's law for word frequencies, are used to break the tie between candidates. 
The big question that future research on dolphins whistles must address is: what is the 
communicative complexity of a system whose units (e.g., whistles types) are distributed 
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following Zipf's law for word frequencies [S], show a parallel of Zipf's law of meaning 
distribution [11] and form sequences with correlations that defy a simple explanation 
such as die rolling or Simon's model? We hope that our research stimulates further 
data collection to determine if the rather short range correlation discovered here are an 
intrinsic property of dolphin whistle communication or a consequence of the small size 
of our dataset. 
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